Identifying Algebraic Properties to Support Optimization of Unary Similarity Queries

نویسندگان

  • Mônica Ribeiro Porto Ferreira
  • Agma J. M. Traina
  • Ires Dias
  • Richard Chbeir
  • Caetano Traina
چکیده

Conventional operators for data retrieval are either based on exact matching or on total order relationship among elements. Neither of them is appropriate to manage complex data, such as multimedia data, time series and genetic sequences. In fact, the most meaningful way to compare complex data is by similarity. However, the Relational Algebra, employed in the Relational Database Management Systems (RDBMS), cannot express similarity criteria. In order to address this issue, we provide here an extension of the Relational Algebra, aimed at representing similarity queries in algebraic expressions. This paper identifies fundamental properties to allow the integration of the unary similarity operators into the Relational Algebra to handle similarity-based operators, either alone or combined with the existing (exact matching and/or relational) operators. We also show how to take advantage of such properties to optimize similarity queries, including these properties into a similarity query optimizer developed for a Similarity Retrieval Engine, which uses an existing RDBMS to answer similarity queries.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimizing similarity queries in metric spaces meeting user's expectation

The complexity of data stored in large databases has increased at very fast paces. Hence, operations more elaborated than traditional queries are essential in order to extract all required information from the database. Therefore, the interest of the database community in similarity search has increased significantly. Two of the well-known types of similarity search are the Range (Rq) and the k...

متن کامل

Geometric Polynomial Constraints in Higher-Order Graph Matching

Correspondence is a ubiquitous problem in computer vision and graph matching has been a natural way to formalize correspondence as an optimization problem. Recently, graph matching solvers have included higher-order terms representing affinities beyond the unary and pairwise level. Such higher-order terms have a particular appeal for geometric constraints that include three or more corresponden...

متن کامل

Learning Unary Automata

We determine the complexity of learning problems for unary regular languages. We begin by investigating the minimum consistent dfa (resp. nfa) problem which is known not to be approximable within any polynomial, unless P = NP . For the case of unary dfa’s, we exhibit an efficient algorithm. On the other hand, we show the intractability of the unary minimum consistent nfa problem but provide an ...

متن کامل

Unnesting and Optimization Techniques for Extended-sql Queries Containing Generalized Quantiiers

Relational database systems do not eeectively support complex queries containing quantiiers (quanti-ed queries). Quantiied queries are becoming increasingly important in decision support applications in general, and health-care information systems in particular. Recently, it has been shown that generalized quantiiers provide an eeective way of expressing such queries naturally, and that general...

متن کامل

Optimization of Systems of Algebraic Equations for Evaluating Datalog Queries

A Datalog program can be translated into a system of fixpoint equations of relational algebra; this paper studies how such a system can be solved and optimized for a particular query. The paper presents a structured approach to optimization, by identifying several optimization steps and by studying solution methods for each step.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009